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Abstract: As the use of camera networks has expanded, camera placement to satisfy some 
quality assurance parameters (such as a good coverage ratio, an acceptable resolution 
constraints, an acceptable cost as low as possible, etc.) has become an important problem. 
The discrete camera deployment problem is NP-hard and many heuristic methods have 
been proposed to solve it, most of which make very simple assumptions. In this paper, we 
propose a probability inspired binary Particle Swarm Optimization (PI-BPSO) algorithm to 
solve a homogeneous camera network placement problem. We model the problem under 
some more realistic assumptions: (1) deploy the cameras in the 3D space while the 
surveillance area is restricted to a 2D ground plane; (2) deploy the minimal number of 
cameras to get a maximum visual coverage under more constraints, such as field of view 
(FOV) of the cameras and the minimum resolution constraints. We can simultaneously 
optimize the number and the configuration of the cameras through the introduction of a 
regulation item in the cost function. The simulation results showed the effectiveness of the 
proposed PI-BPSO algorithm. 

Keywords: camera network placement; coverage; particle swarm optimization (PSO); 
task constraints; camera constraints; scene constraints 



1. Introduction 

Camera networks are used in many novel applications, such as video surveillance [1], room sensing [2], 
smart video conferencing [3], etc. There are some challenge issues in the study of camera networks, 
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such as how to get an optimized camera network coverage, how to design a scalable network 
architecture, how to determine the trade-off between QoS requirements and energy costs [4], etc. 
Among the problems mentioned above, camera network coverage problem is a central issue, which has 
interested many researchers [5-9] and coverage rate is one of the most important performance metrics 
for camera network surveillance utility. Thus, determining the appropriate placement of the cameras 
to achieve the maximum amount of visibility becomes an important issue in designing camera 
network arrangements. 

The camera network deployment problem can be defined as how to place the cameras in the 
appropriate places to maximize the coverage of the camera network under some constraints. The 
constraints can be categorized into three main types: task constraints, camera constraints and scene 
constraints. The task constraints include continuous tracking (enough overlap between cameras), 
people identification (image resolution and focus), complete coverage of the surveillance area (field of 
view of each camera) and so on. The camera constraints include the camera network type (a 
homogeneous or heterogeneous camera network), camera type (PTZ or static camera), camera intrinsic 
parameters (focus length, CCD size, etc.) and so on. The scene constraints include the surveillance area 
(2D or 3D, with or without holes, simple polygon or not), the positions where the camera network can 
be located (2D or 3D, in the walls, in the ceiling, at the same height or anywhere) and so on. 

Since the camera network placement problem is a NP-hard combinatorial optimization 
problem [10], simple enumeration and search techniques will meet great difficulty in determining 
optimal placement configurations. There are many approximate techniques for solving optimal camera 
placement problems, such as the greedy based method, sampling based methodm etc. Zhao et al. have 
provided an excellent survey of the approximate techniques in [11]. Lately, some researchers have 
proposed some evolution-based optimization methods, such as PSO [12], BPSO-PI [13] and ABC [14]. 

There are some weaknesses in these results, mainly due to the overly simple assumptions used. 
From the perspective of the three constraints mentioned above, we give a brief explanation of the 
limitations of the earlier works. From the perspective of task and camera constraints, most of the works 
only consider the coverage of the area while the video resolution and focus are seldom considered; 
From the perspective of scene constraints, most of the scenes are modeled as a 2D case which is too 
simple to conduct the real camera network placement, or modeled as a 3D case which is too restrictive 
because in most of the cases we are only concerned with the surveillance plane area. 

We give several examples. The surveillance area of [13] is modeled as a rectangle in the 3D cases 
while we know that in the real circumstances it is a trapezoid which is sensitive to the orientation of 
the cameras. The constraints in [14,15] only include the coverage rate (FOV is considered), while the 
resolution and focus are out of the scope of the articles. 

In this paper, we consider the deployment of homogeneous camera network in the 3D space to 
surveil a 2D ground plane. For simplicity considerations, the surveillance plane is modeled as a 
rectangle area which is not essential to our work. We separate the surveillance plane area into n grids, 
as illustrated in Figure 1. We assume that the probability of choose each grid is the same l/n and the 
coverage ratio p can be determined by sampling as illustrated in the next section. 
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Figure 1. The surveillance plane area is divided into n grids no matter the shape of 
the area. 
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We take a more synthetic constraints set, including the surveillance video resolution, video focus, 
the camera field of view etc., into consideration. Under the constraints, we propose a probability-inspired 
particle swarm optimization algorithm to get the optimized camera network placement configuration. 

The main contributions of this paper can be summarized as follows: 

- We consider a more realistic problem in that we deploy the cameras in a 3D space to surveil a 
plane area. Some of the previous works consider the problem in a 2D plane and the FOV of the 
camera is modeled as a sector which is too simple an assumption, while some works consider the 
problem in the 3D space and model the FOV of the camera as a cone which is too restrictive an 
assumption. We can get a more accurate result to solve the camera deployment problem in the 
3D space to surveillance of a 2D plane and instruct the real life camera network placement; 

- We take more constraints into consideration than others, including resolution, focus, FOV. Most 
of the previous works only consider the FOV of the camera to get a good coverage while the 
other constraints are important to get a good surveillance video; 

- We propose a probability-inspired PSO algorithm to solve the camera network placement 
problem heuristically. In the algorithm, we introduce a regulated item in the fitness function to 
optimize the coverage ratio and the number of the cameras simultaneously. The experimental 
results show the effectiveness of the algorithm. 

The rest of the paper is organized as follows: we review recent progress in camera network 
deployment in Section 2. In Section 3, we give the camera network placement problem from three 
perspectives. In Section 4, we propose a PI-BPSO algorithm and discuss the representation and fitness 
computation of particles in the algorithm. Simulation results are given in Section 5. We give the 
conclusions and discuss some future work in the last section. 

2. Related Works 

In Computational Geometry, there is a well-known problem called the Art Gallery problem (AGP) [16] 
and some of its variations which are very similar to the problem of camera placement in camera 
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networks. The aim of the camera network deployment is to provide full coverage of the surveillance 
area with the minimal cost, and the aim of the AGP is to monitor an art gallery with the least number 
of guards located at different locations in order to make sure that every point in the museum is seen by 
at least one guard. The difference between the two problems is mainly attributable to the assumption 
on the ability of the "guard" as the AGP assumes the guards have unlimited field of view, infinite 
depth of field, and have infinite precision and speed, while the cameras in the real world don't have 
these abilities. Even if we make the unrealistic assumption of the ability of the guard, the problem is 
still proved to be NP-hard [17,18]. Though the AGP and its variants can't give an exact answer to the 
camera network placement problem, they do give some insights into the problem, the visibility graph 
and the lower bound of the guard numbers. There are some good results in the AGP field which we 
will not introduce here in detail. We refer the reader to an excellent book on the topic [16]. 

The other interesting research problem related to the camera network placement problem is the 
wireless sensor network (WSN) placement problem [4,7,19-22]. To be strict, the camera is known as a 
visual sensor, but because the camera is a kind of directional sensor, this leads to some differences 
between the two disciplines. We refer the reader to the review [6] about the WSN placement problem. 

Researches on the camera network placement problem can be divided into two disciplines, one is 
the coverage problem and the other is the optimization problem. The two problems can be related 
through the optimization framework issued by Zhao et al. [11], where they present that the camera 
network placement can be divided into two broad categories, the MIN and FIX problems. The MIN 
problem is to find the minimum cost cameras to satisfy the minimum coverage ratio, and the FIX 
problem is to find the maximum the coverage subject to a fixed number of cameras. 

The visual coverage of the camera network describes what can be seen and what can't in the 
surveillance area. It is so fundamental to many computer vision tasks that different works have 
suggested different visual coverage models according to the different surveillance tasks. 

The standard coverage model is defined as the area of the surveillance area. It can be classified as 
two cases, from the perspective of the area and the camera. From the perspective of the area [13,23], 
the surveillance area is discrete into some grid and if the center point of a grid can be seen from a 
camera (under some limitation such as resolution and DOF etc.), then we say that the grid can be seen 
from the camera. From the perspective of the camera [24,25], the area that a cameras can monitor is 
determined by the camera's FOV, DOF, resolution and the camera network placement problem is 
turned into a set cover problem. To get a continuous consistently labeled trajectory of the same object, 
Yao et al. [26] add handoff rate analysis to the standard coverage models. To improve the full 
coverage of events and objects/events recognition, Newell et al. [27] give an multi-perspective 
coverage (MPC) model where the coverage is calculated based on the co -perspective coverage 
(the number of perspectives that cover the event). Based on the MPC model, Yildiz et al. [8] give an 
angular coverage model where the coverage of an object is defined as the object can be seen from 
different perspectives that span 360°. We refer the reader to the excellent survey [5]. Most of the work 
model the area as a plane area and the camera is placed in the plane which is too simple to guide the 
real camera network placement. 

As we have stated that the camera network placement problem is NP-hard, researchers have 
put forward various approximate optimization algorithms to solve the problem [11,13-15]. 
Chrysostomou et al. [14] propose a bee colony algorithm as the optimization engine to determine the 
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minimum possible cost (minimum number) of cameras to cover the given space under some camera 
placement constraints such as geometrical, optical, as well as reconstructive limitations and this 
delivers promising preliminary results. Morsly et al. [13] propose a Binary Particle Swarm 
Optimization Inspired Probability (BPSO-IP) algorithm as the optimization engine to ensure the 
accurate visual coverage of the monitoring space with a minimum number of cameras. The authors 
also give a detailed comparison between the BPSO-IP algorithm and other evolutionary -like 
algorithms such as BPSO, Simulated Annealing (SA), Tabu Search (TS) and genetic techniques based 
algorithms to solve the camera network placement problem. Lee et al, [15] give a generic algorithm as 
the optimization engine to solve the camera network placement problem which is modeled as a multi 
objective optimization problem. Zhao et al. [11] put forward a framework to compare the accuracy, 
efficiency and scalability of the greedy-based method, heuristics-based method, sampling-based 
method and LP and SDP relaxation-based method. 

3. Problem Definition and System Model 

We put forward an optimal camera network placement problem to satisfy the need of different 
surveillance tasks on a specific surveillance area. The problem can be modeled as a multi objective 
optimization problem that must satisfy multiple constraints. In this work, we are interested in the static 
camera network placement problem, where the objective is to determine the number of cameras, their 
positions and poses for an rectangle surveillance area, given the intrinsic parameters of the cameras 
(such as focal length, the diameter of the lens's aperture a , the minimum dimension of a camera pixel c , 
the resolution R, etc.) and a set of task-specific constraints (such as the resolution constraints, the focus 
constraints and visibility constraints etc.). Commonly, this camera network placement problem takes 
place off-line to support the task-specific requirements of on-line computer vision surveillance 
systems. But occasionally we shall adjust the layout of the camera network on-line to support the 
different surveillance tasks. 

In the real world circumstances, we often layout some linked cameras to surveillance a square 
which is the main subject of this article. We establish a world frame as that: we model the surveillance 
square as the XOY plane and the upward direction as the Z axis which is shown in Figure 2. 

Figure 2. The coordinate frame we establish to model the surveillance scene where the 
ground plane (the surveillance area) is modeled as the XOY plane and the anti-gravity 
direction is modeled as the Z direction. 




Sensors 2014, 14 



1993 



3.1. Camera Modeling 

We assume that the various cameras used in the layout share the same intrinsic parameters, such as 
the resolution of the camera is R(R h ,R v ), the horizontal and vertical dimensions of the size of the 
Coupled Charge Detector (CCD) element is s(h, w) , the focal length of the camera is / , etc. In our 
work we are mainly interested in the video resolution constraints, which is very common in 
surveillance tasks such that we can identify some persons in the surveillance video, and the focus 
constraints which are necessary to get a clear surveillance video. 

3.1.1. Modeling a Camera's FOV (Field of View) in 3D Space 

We use a pyramid to represent the camera's FOV, which is shown in Figure 3. We are interested in 
the homogeneous camera network placement problem in this article. We set K as the intrinsic 
parameter matrix of the cameras in the network. The camera's position C(x c , y c ,z c ) and rotation 

R{(f>,6,y/) describe the camera's extrinsic parameters where R((j),6,y/) is the Euler angles 
representation of the rotation matrix R [28] and ((f), 9, ys) represents the camera's yaw, pitch and roll 

angles respectively (Figure 3) and we have: 



R = 



siny/ -cosy/ 0 
cosy siny/ 0 



0 cos 6 sin 0 
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Figure 3. The FOV of the camera in 3D space and the camera's yaw, pitch, roll angles 
which is shown as (<f>, 0, y/) . The vector N is the intersect of the XOY ccmera plane and the 

XOY world t(C) (the coordinates that transform the world coordinate to the point C ) plane. 
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For the surveillance application, we desire that the angle between the object and the direction of the 
camera is less than a constant angle 6o (which is critical for feature point extraction and the other 
applications), such as 60°. We assume that the object in the surveillance video is vertical to the ground 
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(which is almost true because Pisa tower is seldom), then we get a constraints on the ((j),0,y/) 



that 0<0 O . As Figure 3 shows, the surveillance area of the camera is ABCD , and the coordinates of 
the 4 vertexes can be determined by: 
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where is the CCD height and width, K is the intrinsic matrix of the camera and (jc f , 1 < i <4 



is the coordinates of the vertexes of ABCD . We can transfer the above equation to a more simple one: 
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(3) 



If we know that a camera's configuration (R -C) , we can get the coordinates of the four vertices 
of the quadrangular surveillance area in the plane. Then we can determine whether the point P in the 
plane can be covered by the configuration use P e ABCD . 

3.1.2. Modeling the Resolution Constraints in 3D Space 

In this section we describe the relationship between the surveillance video resolution constraints 
and the camera's position. For a specific surveillance camera, the required resolution provides an 
upper bound on the distance between the camera and the surveillance area. Figure 4 illustrates the 
image process of an object S lying at distance D from the lens center, where the distance between the 
image and the lens center is d . There is a relationship between d , D and the lens focal length / by 

the Gaussian lens equation: 



d + D~J 



(4) 
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Figure 4. Camera and spatial resolution for the camera. 




The surveillance video resolution of object S, described as the pixels per unit length, depends on the 
direction in which it is measured. It is easy to see that the maximum resolution occurs along the rows 
or columns of pixels and the minimum occurs along the diagonal of each pixel. In consideration of 
simplicity, we assume that the desired resolution constraints is a pixels per unit length corresponding 
to the diagonal of each pixel, then: 



, I, h - 2 / w. 2 . 
«(J(^r) +(tt) ) 



R. 



(5) 



D 



Max 



The distance between the position of the camera len's center and the surveillance area D 
must satisfy: 



D < D 



(6) 



3.1.3. Modeling the Focus Constraints in 3D Space 



In this section, we determine the constraints on the camera's viewpoints when we request that all 
the points of the surveillance area must be sharp (in focus) in some surveillance camera's FOV. For 
any camera, there is only one plane on which the camera can precisely focus, and the point object in 
any other plane is imaged as a disk (known as the blur spot) rather than a point. When the diameter of 
blur spot is sufficiently small, the image disk is indistinguishable from a point. The diameter of the 
blur spot is known as the acceptable circle of confusion, or simply as the circle of confusion (CoC). 
We can easily induce that there is a region of acceptable sharpness between two planes on either side 
of the plane of focus which is illustrated in Figure 5. The region is known as depth of field (DOF). 
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Figure 5. DOF for the camera. 




Here we assume the CoC is the minimum dimension of a camera pixel c , The focus distance is D , 
the lens's focal length / , the diameter of the lens's aperture a , relative aperture (f-number) of lens 
N. We can determine the maximum distance D Max , and the minimum distance D Min as follows [29]: 
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If we want the video to be sharp, then: 



D Min <D<D Max 



Then we have that the height of the camera must satisfy (illustrated in Figure 6): 



D Min Cos0 Q <H<D Max 



(7) 
(8) 
(9) 

(10) 
(11) 



3.2. Space Modeling 



In theory, cameras can be located anywhere in the space since the camera position variables 
x c ,y c ,z c and the pose variables <p,6 and y/ are all continuous variables. In practice, we ordinarily 

restrict the selection of the cameras' location and pose in a discrete parameter space which is 
determined by the spatial sampling in the continuous parameter space. So we can transform the 
continuous optimization problem to a discrete optimization problem. We should state that as the spatial 
sampling frequencies f x ,f y ,f z ,f p ,f 9 and f v — >oo, the approximated discrete solution converges to 

the continuous-case solution. 
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Figure 6. The feasible height of the cameras. We can see that the point P will be not sharp 
if the height of the camera's height H don't satisfy D Min Cos6> 0 <H< D Max . That is to say 

that D Min Cos6> 0 < H < D Max is a necessary condition for the point P in sharp. 0 Q is the 

biggest feasible angle between the direction of the camera and the object which is 
discussed above. 
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In the real situation, there are some other different constraints on the allowed deployable area such 
as the constraints that the cameras must be located on the walls of the indoor surveillance environment 
or the constraints that the cameras must be located at some restricted height for the consideration of 
their management, but in this article we only restrict the position of the cameras according to the 
constraints of surveillance video resolution, FOV (field of view) and DOF (depth of field), which are 
described in the previous section. 

3.3. Camera Network Placement Problem Modeling 

We can model the camera network placement as an optimization problem which is defined as 
maximization of some utility function given some cameras and task constraints. From the definition of 
the model, we can analyze the problem from four perspectives: the utility function (or cost function), 
the task constraints, the space constraints and the camera's parameters. Let T be the given task and let 
Ct be the set of all constraints imposed by the task T as: quality of service constraints like resolution 
constraints and focus constraints, square coverage constraints, camera network overlap ration 
constraints etc. 

Let the vector C(C',C E ) represents the camera's intrinsic and extrinsic (position and pose) 
parameters. The problem is to find where to layout the set of cameras in the specific area A to 
minimize the cost function under the set of constraints C T : 

argmin c G(C) where CeAr\C T (12) 

where C = (C r i = 1,2,..., N) is the cameras we want to place in the surveillance area A, GiC) is the 
cost function. 

In this work we are mainly interested in the 2D plane surveillance by a camera network located in a 
3D environment. In this special surveillance scene, we want to place the minimum number of cameras 
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in the optimal locations (position and pose) to maximize the visual coverage under the surveillance 
task constraints. Based on the assumptions we make above, we give the following specific model to 
solve this visual sensor placement. 

Suppose that we have N cameras C = (C r i = 1,2,..., N c ) in the network, and the positions and poses 
that we can place our cameras are S = (S { ,i = 1,2,..., N s ) (which are determined by the space modeling 
we discuss in the section above), and the surveillance area A is modeled as a surveillance space 
T = (J n i = 1,2,..., N T ) . The surveillance space has different forms according to the different tasks. 
In the situation we are interested in this article, we reduced the area to a rectangular grid and choose 
the middle point of each grid (T n i = \,2,...,N T ) as the representation of the surveillance area. 

We define a binary variable \b ij :i = \,...,N s ,j = \,..,N c } to represent the camera network 
configuration as follows: 

b i - = \ if a camera C ■ is placed in a location S ; 

b i ( = 0 otherwise 

We define a binary variable . :i = l,...,N T ,j =1,...,N C } to represent the surveillance utility of 
the camera network. 

x u = 1 if a camera C. can see the object T i 
x. . = 0 otherwise 

Based on the notations we define above, we define the utility function GiC) as: 

G(C) = ju(C) + Af(N c ) (13) 

where ju(C) is the surveillance utility function of the camera network which is increase as the 
number of the cameras increase. In our experiment, we set ju(C) = p where p is the ratio of the 

surveillance area: 

Z max M, ; > (14) 



p = -^- 



N T 



When the number of grids in the surveillance area is too large, then we can apply a sample method 
to determine the coverage. When we choose m grids to check if they are covered by the camera 
network and k grids is covered then we can see that the coverage rate is: 

p = - (15) 
m 

f(N c ) is a regulation item which decreases as the number of the cameras increases, so it can be a 
punitive item to the number of the cameras. In our experiment, we set f(N c ) = — !— . 

Based on the augment above, we turn the camera network placement problem to an optimization 
problem as follows: 

Nj 

Z max My) j (16) 

argmax r (^-^ h/t — ) 

B c N T N c 

Satisfies: 
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V c _ b i < 1 one location only has one camera 



J].^ j < 1 a camera can only be placed in one state (position and pose) 



£max,(^) 

;_i 





4. Optimization Method 

Evolutionary computation technique, motivated by the evolution of Nature, is a powerful tool 
to approximately solve many NP-hard problems. There are many kind of evolution computing 
methods, such as genetic algorithm (GA) [30], differential evolution (DE) [31], artificial bee colony 
(ABC) [32,33]. The differences between different evolution algorithms mainly result from the different 
observations on Nature and the methods used to get new solutions from the old hypothesis. Among the 
various evolutional computing methods, Particle Swarm Optimization (PSO) [34] inspired by the 
manner that a flock of birds or fishes exhibit a coordinated collective behavior during the travel, is an 
optimization tool used to deal with various optimization problems [13,35]. 

The PSO algorithm consists of a population of agents called particles, each of which is a potential 
solution to the optimization problem. The particle i , has a memory to record the current (time t ) 
position (solution) x\ , the previous personal best position pb' and a velocity v\ at which speed the 
particle fly to the next position. We assume that the global best position of the group of particles is 
gb' until time t . The position and velocity of each particle in standard PSO algorithm are updated as 
the followed equations: 



where co is the inertia weight, (p x and (p 2 are the weights of the personal influence and the society 
influence, U[ and U' 2 are two random numbers uniformly distributed in the interval [0,1] . There are 
many variants of the standard PSO algorithm, and the main differences between them lies in the 
velocity update strategy used. 

Despite the different form of the various PSO algorithms that can be used, the main task during the 
solving process is to determine the representation of the particles and the fitness of each particle. That 
is to say, to solve the camera network placement with the PSO approach, we should build a mapping 
between the camera network deployment solution to the particle state and calculate the fitness of each 
particle's state. Because we represent the solution space in a discrete space, we must have a 
mechanism to transform the new position of the particle to a legal place. Like any optimization 
method, the initialization is very important for the convergence speed and solution quality. We will 
illustrate the important factors above in the next sections. 




(17) 



inertia personal society 



v\ = cwl-' + KUKpb'-xD + p^igb' -x\) 



(18) 
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4.1. Representation 

In this section, we describe the state representation of the camera network placement problem. Each 
of the PSO particles is represented as a f x x f y x f z x / x f 0 x f dimensional array, where each 

column is a binary value representing the position and the orientation of the camera to be deployed. 
From the definition, we know that the number of 1 in the array represent the number of the cameras 
deployed in the network. The PSO population of the particles can be set to a const, such as 10,000. 

Next we show how to represent a camera network placement problem as an example. Suppose that 
we deploy four cameras to surveil a rectangular area and the cameras can be placed in a 

Position Orientation 

2x2x1x2x2x1 grid m the 3D space where f z = 1 represents that we set the cameras in the same 
height above the surveillance plane in the example and f ¥ = \ represents that we neglect the influence 

of the rotation of the cameras across the axis Z in the example. We can simplify the particle state 

Position Orientation 

space to a 2x2x 2x2 space. 

Figure 7 shows a mapping between one camera network placement instance to a particle's state in 
the PSO state space where each tuple represent the corresponding position and pose of a deployed 
camera on the 3D grid. For example, the camera state (0,0,0,1) can be transferred to the position 
0 x 2 3 + 0 x 2 2 + 0 x 2 1 + 1 x 2° = 1 of 1 in the particle space, and the other camera state can also be 
transformed to the position of 1 in the particle state. 

Figure 7. A particle state representation for a camera network placement example where 
we can see that the number of 1 in the particle state is 4. 



(0, 0, 0, 1), (0, 1, 0, 1), (1, 0, 0, 0), (1, 1, 0, 1) 
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4.2. Fitness Assignment 

The fitness of each particle is calculated according to the cost function G(C) of the camera network 
placement problem. At first we should transfer the particle's state to a camera network placement 
solution which is a inverse problem of the particle representation which is shown in Figure 8. 
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Figure 8. A camera network placement strategy for a particle state which is a inverse 
process of the particle representation. 
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Then we can get the fitness of the camera network configuration. 

4.3. Flowchart of the Proposed PI-BP SO Algorithms 

The flows of most PSO algorithms are very similar, that is a four step iterations of "initialization, 
evaluate, update, stop". The main difference between them is the strategy of new particles 
generation and the update strategy. In this work, we propose a probability induced binary particle 
swarm optimization algorithm (PI-BPSO) which is a extension of the algorithm propose in [13]. 
In PI-BPSO, the bit value of each particle's state is determined by the "or" value of its current value 
(current state xy) and the probability of being "1" (velocity v,y) which is updated according to the 
information sharing mechanism of PSO. The main steps of PI-BPSO can be described as follows 
(see Figure 9). 

Step 1: Initialization. Setting the size of the population of the PSO and create the population of 
particles which is encoded as the position and orientation of the cameras in the network. 

Step 2: Calculation. Calculate each particle i 's best state pb\ and the global best state gb' of 
particles until time t : 

pb\ = argrnin, G(Q (19) 

gb' =axgmax i pb' (20) 

Step 3: New particle generation. Generate the new position of the particle i according to the 
probability P' , P' step by step. 
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Figure 9. PI-BPSO flowchart. We should point out that we should calculate both the 
personal best and global best of the particles P'. , P'. to compute the velocity of 
the particle. 
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First we calculate the probability P' pi , P' gi as follows: 



PL 



1 



Pi, 



pi i+(g(c;.)-g(c;» 
i 



l + (G(C')-G(C')) 



(21) 



(22) 



where G(C' pi ) is the utility when the particle i in its best position until time t, G(C/) is the utility 
when the particle i in time t , G(C' g ) is the maximum utility of all the particles until time t. 

Secondly, we generate two uniformly distributed random numbers rL^ and compare them with 
the probability P' pj , P g \ to determine the personal velocity and society velocity as follows: 



Sensors 2014, 14 



2003 






I... otherwise 



I... otherwise 



(23) 



(24) 



Thirdly, we determine the velocity at time t : 

a = <p 1 v?/ + <p 2 v l 



'ff where q> x + (p 2 = 1 



(25) 





1... otherwise 



(26) 



Fourthly, we determine the new state of the particle i : x'^ 1 = x\- © v y , where © is a logical 
operator we define as follows: If v?. =\,x , ~ l = x' jj else x' +1 = not(Xy) which is different from [13] 

where the operator is "OR". In the work of [13], when one position is assigned to be "TRUE", then it 

will almost always be assigned a camera there, we think this is unreasonable in some sense. We should 
explain one case clearly. If x ;> = 1 for all j , the fitness of the configuration will not be the best 

particle candidate (because the regulation item in the utility function) and the velocity will have great 
probability to be zero. Considering the update strategy in our algorithm, some position of the particle 
will be set to "0" with great probability. So the optimization process will not halt. 

Step 4: Particle fitness computation. Compute the fitness of the particles according to its new 
positions x\ . 

Step 5: Decide whether to update or stop according to the terminate criteria. Determine whether 
the terminate criteria is satisfied (the upper limit of the iteration times or the accuracy precision). If the 
terminate criteria are satisfied, then output the global best particle, else update the new personal best 
solution of each particle and the global best particle and goto the next iteration. 

In the algorithm aspect, the main difference between our algorithm and [13] is that we distinguish 
the person influence and the society influence which are very important for the success of the PSO 
algorithms while [13] only gives a update strategy which ignores the two influence factors. 

There is another key difference between the two methods: PI-BPSO uses a regulation item in the 
cost function to determine the appropriate number of cameras while BPSO-PI [13] doesn't consider the 
issue in the article. The benefit of involving the number of cameras in the cost function is evident that 
is when the benefit of increasing of the number of cameras can't cover the cost of the increase of the 
number of the cameras, we should stop increasing the number of cameras. We think that this intuitive 
idea will help reach get the camera network placement solution more easily. 

5. Simulation Results 

Most of the constrained optimization problems solved by PSO [13,36] apply some kind of 
feasibility preserving strategy to handle the constraints, which means that we only preserve the 
particles that satisfy all the constraints mentioned above. Through the strategy we transfer the 
constrained optimization problem to an unconstrained ones. 
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For the problems in this article, we apply the following strategy: 

(1) During the initialization stage, all particles are started with feasible solutions; 

(2) During the updating stage, only the feasible particles are kept in their memories. 

To evaluate the effectiveness and reliability of the proposed algorithm, we adopted the following 
experiment configuration: 

(1) The surveillance area is a 50 m x 50 m square; 

"4.8 0 0" 

(2) The camera intrinsic parameter is K= 0 4.8 0 

0 0 1 

(3) The minimum resolution requirement is a = 1 pixel/cm (for face recognition, a face is imaged 
as a 20 x 20 square); 

(4) The CCD size is 1 / 4" (3.2 mm x 2.4 mm) and the resolution of the camera is 1,024 x 768 ; 

2^r 3^r 

(5) The Euler angle of the camera is restricted in the following range: — < 9 < — . 

We do two kinds of experiments. One kind of experiment is to determine the influence of the 
different sampling frequencies on the solution of the camera network deployment problem; the 
second kind of the experiment is to determine the influence of the X parameters on the cost 
functions. We set the population size at 20 and the maximum number of iterations in the experiments 
is 10,000. 

We present some results obtained by the algorithms (Figures 10-12). In the figures, blue bold line 
represents the area that needs to be surveilled and the red dash lines represent the FOV of the 
cameras. In Figures 11 and 12, we choose the same surveillance area and choose different sample 
frequencies for the pose parameters. In Figure 11, the parameters is ( f x = f = f = f =4 , 
/ = f e = 2 ), the fitness of the surveillance is 0.9785, the number of the surveillance cameras is 24; 
while in Figure 12, the parameters is ( f x — f — f z — f — f - f e — 4 ), the fitness of the surveillance 
grows to 0.9908 while the number of the surveillance cameras is decreased to 16. From the 
experiments, we know that we can get more accurate results when we use a bigger sample space. In 
Figure 11, the sample space is 4,096 which is too big for classical Binary Integer Linear 
programming optimizations. 

In the utility function, we set a regulization item which is a penalty for the camera number. We do 
an experiment to determine the effect of the X parameter. In the next experiment we choose the 
task parameter as follows: surveillance area is 50 m x 50 m and the sample frequency is 

fx f y f z f <p f y/ f 0 ^ * 

In Figure 13, the regulization parameter A = 2. In Figure 14, the regulization parameter X = 0.1 . In 
Figure 15, the regulization parameter X = 0.01. We can see that the regulization parameter takes 
effects to control the number of the cameras. 
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Figure 10. Optimal placement of cameras (six cameras). The surveillance area is 10 m x 10 m, 

A =4, /,=4,/ r =2,/,=4,/,=2,/ r =4,A = l. 
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Figure 11. Optimal placement of cameras (24 cameras). The surveillance area is 
50mx50m, f x =4,f y =4,f t = 2,/„ =4,/, = 2,f ¥ =4,4 = 1. 
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Figure 12. Optimal placement of cameras (16 cameras). The surveillance area is 50 m x 50 m, 
/,=4,/,=4,/ =4,/ =4,/,=4,/ =4, X = l. 
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Figure 13. Optimal placement of cameras (14 cameras). The ssurveillance area is 50 m x 50 m, 
/,=4,/,=4,/ f =4,/,=4,/,=4,/ r =4, 1 = 2. 
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Figure 14. Optimal placement of cameras (18 cameras). The surveillance area is 50 m x 50 
/,=4./ =4,/ z =4,/ =4,/,=4,/ =4,A = 0.1. 



m. 




Figure 15. Optimal placement of cameras (20 cameras). The surveillance area is 50 m x 50 m, 
/,=4,/,=4,/ 4 =4,/ r =4,/,=4,/ r =4,A = 0.01. 
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We do an experiment to compare PI-BPSO with BPSO-IP, GA and ABC from three perspectives: 
iteration times, fitness and computation time. We repeat the experiment 50 times and the value of the 
iteration times, fitness and computation time are the average values of the 50 tests. Because the codes 
of the other methods are all unlnown, the experimental results may not be same as the original author 
stated. We show the result in Figure 16. 
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Figure 16. The comparison result of different optimization method, (a) The iteration 
times comparison result; (b) The fitness comparison result; (c) The computation time 
comparison result. 



nerati or times 




(c) 

We do an experiment with real cameras in a hall building with the simulation result that we have 
obtained from the simulation. The surveillance area is a basketball court and a man is roller skating. 
We determine the position and pose of the surveillence cameras using the PI-BPSO algorithm. The 
experimental result shows the effect of the algorithm is that the video is in focus and the people in the 
video can be identified easily. We show the result in Figure 17. 
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Figure 17. Some images from the real surveillence video to validate the effectiveness of 
the algorithm, (a) The video is in focus; (b) The resolution constraints are satisfied and the 
person can be identified in the video. 




(a) (b) 

6. Conclusions 

In this paper, we discuss the automatic camera network placement problem which is solved by an 
evolution-like method, PI-BPSO. The different simulation results show the effectiveness of the 
proposed algorithm. The algorithm is guaranteed to get a global optimum with high probability due to 
two reasons: 

(1) The initialization and the update process are both determined randomly which assures that we 
will now land in the local minimum with high probability; 

(2) The introduction of the regulation item eases the optimization process and allows us to 
optimize the number of the cameras and the configuration of the cameras at the same time. 

In the future, we will continue our study on the camera network placement problem from several 
directions, such as a more accurate initialization or integration of the PSO and other optimization 
methods to speed up the convergence. 
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